Psychoacoustically-motivated adaptive β-order generalized spectral subtraction based on data-driven optimization
نویسندگان
چکیده
To mitigate the performance limitations caused by the constant spectral order β in the traditional spectral subtraction methods, we previously presented an adaptive β-order generalized spectral subtraction (GSS) in which the spectral order β is updated in a heuristic way [10]. In this paper, we propose a psychoacoustically-motivated adaptive β-order GSS, by considering that different frequency bands contribute different amounts to speech intelligibility (i.e., the bandimportance function). Specifically, in this proposed adaptive β-order GSS, the tendency of spectral order β to change with the input local signal-to-noise ratio (SNR) is quantitatively approximated by a sigmoid function, which is derived through a data-driven optimization procedure by minimizing the intelligibility-weighted distance between the desired speech spectrum and its estimate. The inherent parameters of the sigmoid function are further optimized with the data-driven optimization procedure. Experimental results indicate that the proposed psychoacoustically-motivated adaptive β-order GSS yields great improvements over the traditional spectral subtraction methods with the intelligibility-weighted measures.
منابع مشابه
Adaptive β-order Generalized Spectral Subtraction for Speech Enhancement
The performance degradation of speech communication systems in noisy environments inspired increasing research on speech enhancement and noise reduction. As a well-known single-channel noise reduction technique, spectral subtraction (SS) has widely been used for speech enhancement. However, the spectral order β set in SS is always fixed to some constants, resulting in performance limitation to ...
متن کاملNoise reduction based on adaptive β-order generalized spectral subtraction for speech enhancement
Though spectral subtraction has widely been used for speech enhancement, the spectral order β set in spectral subtraction is generally fixed to some constants, resulting in the performance limitation to a certain degree. In this paper, we first analyze the performance of the β-order generalized spectral subtraction in terms of the gain function to highlight its dependence on the value of spectr...
متن کاملA single channel speech enhancement technique exploiting human auditory masking properties
To enhance extreme corrupted speech signals, an Improved Psychoacoustically Motivated Spectral Weighting Rule (IPMSWR) is proposed, that controls the predefined residual noise level by a time-frequency dependent parameter. Unlike conventional Psychoacoustically Motivated Spectral Weighting Rules (PMSWR), the level of the residual noise is here varied throughout the enhanced speech based on the ...
متن کاملWeighted Log-spectral Amplitude Estimation with Generalized Gamma Distribution under Speech Presence Probability
In this paper, we propose a speech enhancement approach. The approach is based on deriving weighted log-spectral amplitude estimator that exploits the generalized Gamma distributed speech priors under speech presence probability. The log-spectral amplitude estimator is weighted by psychoacoustically motivated speech distortion measure to take advantage of the perceptual interpretation. The expe...
متن کاملNOISE REDUCTION USING mel-SCALE SPECTRAL SUBTRACTION WITH PERCEPTUALLY DEFINED SUBTRACTION PARAMETERS-A NEW SCHEME
The noise signal does not affect uniformly the speech signal over the whole spectrum isn the case of colored noise. In order to deal with speech improvement in such situations a new spectral subtraction algorithm is proposed for reducing colored noise from noise corrupted speech. The spectrum is divided into frequency sub-bands based on a nonlinear multiband bark scale. For each sub-band, the n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008